Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program

نویسنده

  • Alan R. Aronson
چکیده

The UMLS Metathesaurus, the largest thesaurus in the biomedical domain, provides a representation of biomedical knowledge consisting of concepts classified by semantic type and both hierarchical and non-hierarchical relationships among the concepts. This knowledge has proved useful for many applications including decision support systems, management of patient records, information retrieval (IR) and data mining. Gaining effective access to the knowledge is critical to the success of these applications. This paper describes MetaMap, a program developed at the National Library of Medicine (NLM) to map biomedical text to the Metathesaurus or, equivalently, to discover Metathesaurus concepts referred to in text. MetaMap uses a knowledge intensive approach based on symbolic, natural language processing (NLP) and computational linguistic techniques. Besides being applied for both IR and data mining applications, MetaMap is one of the foundations of NLM's Indexing Initiative System which is being applied to both semi-automatic and fully automatic indexing of the biomedical literature at the library.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Summarization of Biomedical Documents Using Word Sense Disambiguation

We describe a concept-based summarization system for biomedical documents and show that its performance can be improved using Word Sense Disambiguation. The system represents the documents as graphs formed from concepts and relations from the UMLS. A degree-based clustering algorithm is applied to these graphs to discover different themes or topics within the document. To create the graphs, the...

متن کامل

Biomedical term normalization of EHRs with UMLS

This paper presents a novel prototype for biomedical term normalization of electronic health record excerpts with the Unified Medical Language System (UMLS) Metathesaurus. Despite being multilingual and cross-lingual by design, we first focus on processing clinical text in Spanish because there is no existing tool for this language and for this specific purpose. The tool is based on Apache Luce...

متن کامل

Comparison of LVG and MetaMap Functionality

LVG and MetaMap both compute lexical variants but were developed for quite different purposes: LVG’s raison d’être is lexical variant generation whereas MetaMap’s main purpose is to map text to corresponding concepts in the UMLS® Metathesaurus (Meta), one of the UMLS knowledge sources. Besides generating lexical variants, LVG has the subsumed ability to normalize words and the supplementary abi...

متن کامل

The effect of textual variation on concept based information retrieval.

Accounting for textual variation in the documents and queries processed by information retrieval systems is considered essential for achieving good retrieval. Recent research has called into question several of the techniques used to support this endeavor. This paper reports on experiments with a concept based information retrieval system which relies on a program called MetaMap to account for ...

متن کامل

طرح ادغام سرشاخه خوشه طب سنتی ایران در ساختار ابَراصطلاحنامه « نظام زبان واحد پزشکی (UMLS)»

Background & Aim: Unified Medical Language System (UMLS) is an extensive ontology of biomedical knowledge developed and maintained by U.S. National Library of Medicine (NLM). Traditional Iranian Medicine (TIM) does not have any position in the structure of metathesaurus of UMLS. The main aim of this study was designing a scheme of TIM cluster's crotch mapping in the structure of metathesaurus o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings. AMIA Symposium

دوره   شماره 

صفحات  -

تاریخ انتشار 2001